Intercept Guidance of Maneuvering Targets with Deep Reinforcement Learning
نویسندگان
چکیده
In this paper, a novel guidance law based on reinforcement learning (RL) algorithm is presented to deal with the maneuvering target interception problem using deep deterministic policy gradient descent neural network. We take missile’s line-of-sight (LOS) rate as observation of RL and propose reward function, which constructed miss distance LOS train network off-line. process, trained has capacity mapping normal acceleration missile directly, so generate commands in real time. Under actor-critic (AC) framework, we adopt twin-delayed (TD3) by taking minimum value between pair critics reduce overestimation. Simulation results show that proposed TD3-based outperforms current state law, better performance cope continuous action space, also faster convergence speed higher reward. Furthermore, accuracy robustness when intercepting target, converged.
منابع مشابه
A Missile Guidance Law Based on Sontag’s Formula to Intercept Maneuvering Targets
In this paper, we propose a nonlinear guidance law for missiles against maneuvering targets. First, we derive the equations of motion described in the line-of-sight reference frame and then we define the equilibrium subspace of the nonlinear system to guarantee target interception within a finite time. Using Sontag’s formula, we derive a nonlinear guidance law that always delivers the state to ...
متن کاملIntegrated fuzzy guidance law for high maneuvering targets based on proportional navigation guidance
An integrated fuzzy guidance (IFG) law for a surface to air homing missile is introduced. The introduced approach is a modification of the well-known proportional navigation guidance (PNG) law. The IFG law enables the missile to approach a high maneuvering target while trying to minimize control effort as well as miss-distance in a two-stage flight. In the first stage, while the missile is far ...
متن کاملA Comprehensive Approach to Develop a Continuous Fuzzy Guidance Law for Maneuvering Targets
Based on the idea of Continuous Fuzzy Guidance Law (CFGL), a andldquo;three-phase fuzzy guidanceandrdquo; (TFG) law is proposed for the class of surface to air homing missiles. The current approach enables the guidance law to track a maneuvering target from the beginning of the launch phase up to the terminal one while itdynamically attempts to keep miss-distance, flight time and control effort...
متن کاملOperation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm
: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...
متن کاملDeep Reinforcement Learning with POMDPs
Recent work has shown that Deep Q-Networks (DQNs) are capable of learning human-level control policies on a variety of different Atari 2600 games [1]. Other work has looked at treating the Atari problem as a partially observable Markov decision process (POMDP) by adding imperfect state information through image flickering [2]. However, these approaches leverage a convolutional network structure...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Aerospace Engineering
سال: 2023
ISSN: ['1687-5966', '1687-5974']
DOI: https://doi.org/10.1155/2023/7924190